NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Bao: Making Learned Query Optimization Practical

https://doi.org/10.1145/3448016.3452838

Marcus, Ryan; Negi, Parimarjan; Mao, Hongzi; Tatbul, Nesime; Alizadeh, Mohammad; Kraska, Tim (June 2021, SIGMOD '21: Proceedings of the 2021 International Conference on Management of Data)

Full Text Available
Flow-loss: learning cardinality estimates that matter

https://doi.org/10.14778/3476249.3476259

Negi, Parimarjan; Marcus, Ryan; Kipf, Andreas; Mao, Hongzi; Tatbul, Nesime; Kraska, Tim; Alizadeh, Mohammad (July 2021, Proceedings of the VLDB Endowment)

Recently there has been significant interest in using machine learning to improve the accuracy of cardinality estimation. This work has focused on improving average estimation error, but not all estimates matter equally for downstream tasks like query optimization. Since learned models inevitably make mistakes, the goal should be to improve the estimates that make the biggest difference to an optimizer. We introduce a new loss function, Flow-Loss, for learning cardinality estimation models. Flow-Loss approximates the optimizer's cost model and search algorithm with analytical functions, which it uses to optimize explicitly for better query plans. At the heart of Flow-Loss is a reduction of query optimization to a flow routing problem on a certain "plan graph", in which different paths correspond to different query plans. To evaluate our approach, we introduce the Cardinality Estimation Benchmark (CEB) which contains the ground truth cardinalities for sub-plans of over 16 K queries from 21 templates with up to 15 joins. We show that across different architectures and databases, a model trained with Flow-Loss improves the plan costs and query runtimes despite having worse estimation accuracy than a model trained with Q-Error. When the test set queries closely match the training queries, models trained with both loss functions perform well. However, the Q-Error-trained model degrades significantly when evaluated on slightly different queries (e.g., similar but unseen query templates), while the Flow-Loss-trained model generalizes better to such situations, achieving 4 -- 8× better 99th percentile runtimes on unseen templates with the same model architecture and training data.
more » « less
Full Text Available
Interpreting Deep Learning-Based Networking Systems

https://doi.org/10.1145/3387514.3405859

Meng, Zili; Wang, Minhu; Bai, Jiasong; Xu, Mingwei; Mao, Hongzi; Hu, Hongxin (July 2020, SIGCOMM '20: Proceedings of the Annual conference of the ACM Special Interest Group on Data Communication on the applications, technologies, architectures, and protocols for computer communication)

Full Text Available
Learning scheduling algorithms for data processing clusters

https://doi.org/10.1145/3341302.3342080

Mao, Hongzi; Schwarzkopf, Malte; Venkatakrishnan, Shaileshh Bojja; Meng, Zili; Alizadeh, Mohammad (August 2019, ACM SIGCOMM 2019)

Full Text Available
Placeto: Learning Generalizable Device Placement Algorithms for Distributed Machine Learning

Addanki, Ravichandra; Bojja Venkatakrishnan, Shaileshh; Gupta, Shreyan; Mao, Hongzi; Alizadeh, Mohammad (January 2019, Advances in Neural Information Processing Systems 32 (NIPS 2019))

Full Text Available
Neo: a learned query optimizer

https://doi.org/10.14778/3342263.3342644

Marcus, Ryan; Negi, Parimarjan; Mao, Hongzi; Zhang, Chi; Alizadeh, Mohammad; Kraska, Tim; Papaemmanouil, Olga; Tatbul, Nesime (July 2019, Proceedings of the VLDB Endowment)

Full Text Available
Park: An Open Platform for Learning-Augmented Computer Systems

Mao, Hongzi; Negi, Parimarjan; Narayan, Akshay; Wang, Hanrui; Yang, Jiacheng; Wang, Haonan; Marcus, Ryan; Addanki, Ravichandra; Khani Shirkoohi, Mehrdad; He, Songtao; et al (January 2019, Advances in Neural Information Processing Systems 32 (NIPS 2019))

Full Text Available

Search for: All records